Performance Evaluation of Quantitative Metrics on Ancient Text Documents Using Migt
نویسندگان
چکیده
In the present world scenario Optical Character Recognition (OCR) has wide variety of applications in the text document image analysis for recognizing individual characters of any language. Digitizing the old documents is a tough job for preserving the essence of the documents to the coming eras. In this paper we are summarizing different image quantitative metrics for estimating the loss of information from the image after cleaning the noisy image by using anyone of the local or non-local thresholding techniques. The quality evaluations are made on 40 Telugu and English text documents after cleaning the documents with Modifies Iterative Global Threshold (MIGT) approach.
منابع مشابه
Quality of Hospital Bed Performance Studies based on Pabon Lasso Model
Hospitals’ bed productivity has a remarkable effect on health system performance. The Pabon Lasso Model (PLM) is a useful tool for evaluation of inpatient beds performance and there is a growing trend in use of this technique in hospital performance evaluation. The aim of this study is to review the literature on PLM to gain insight into quality the results of these studies. By adopting a syste...
متن کاملAn Analysis of Quantitative Aspects in the Evaluation of Thematic Segmentation Algorithms
We consider here the task of linear thematic segmentation of text documents, by using features based on word distributions in the text. For this task, a typical and often implicit assumption in previous studies is that a document has just one topic and therefore many algorithms have been tested and have shown encouraging results on artificial data sets, generated by putting together parts of di...
متن کاملText Summarization Using Cuckoo Search Optimization Algorithm
Today, with rapid growth of the World Wide Web and creation of Internet sites and online text resources, text summarization issue is highly attended by various researchers. Extractive-based text summarization is an important summarization method which is included of selecting the top representative sentences from the input document. When, we are facing into large data volume documents, the extr...
متن کاملUnsupervised Methods of Topical Text Segmentation for Polish
This paper describes a study on performance of existing unsupervised algorithms of text documents topical segmentation when applied to Polish plain text documents. For performance measurement five existing topical segmentation algorithms were selected, three different Polish test collections were created and seven approaches to text preprocessing were implemented. Based on quantitative results ...
متن کاملSummarization Evaluation : Correlating Human Performance on an Extrinsic Task with Automatic Intrinsic Metrics
Title of dissertation: Text Summarization Evaluation: Correlating Human Performance on an Extrinsic Task with Automatic Intrinsic Metrics Stacy F. Hobson Doctor of Philosophy, 2007 Dissertation directed by: Professor Bonnie J. Dorr Department of Computer Science Text summarization evaluation is the process of assessing the quality of an individual summary produced by human or automatic methods....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015